Optimal estimation of SU(d) using exact and approximate 2-designs 
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We consider the problem of estimating an SU(d) quantum operation when n copies of it are 
available at the same time. It is well known that, if one uses a separable state as the input for the 
unitaries, the optimal mean square error will decrease as 1/n. However it is shown here that, if a 
proper entangled state is used, the optimal mean square error will decrease at a 1/n 2 rate. It is 
also shown that spherical 2-designs (e.g. complete sets of mutually unbiased bases and symmetric 
informationally complete positive operator valued measures) can be used to design optimal input 
states. Although 2-designs are believed to exist for every dimension, this has not yet been proven. 
Therefore, we give an alternative input state based on approximate 2-designs which can be made 
arbitrarily close to optimal. It is shown that measurement strategies which are based on local 
operations and classical communication between the ancilla and the rest of the system can be 
optimal. 



I. INTRODUCTION 

The problem of estimating a completely unknown U G 
SU(d) unitary operation is studied in this paper. It is 
assumed that n copies of U are available. The idea is 
to prepare a suitable input state, use it as an input for 
jj®n an( j measu re the output. One could also allow for 
an ancilla, i.e., a part of the input state that is left un- 
touched. In addition to being interesting in itself, SU (d) 
estimation also has applications in the problem of opti- 
mal alignment of reference frames [J 0, d, B| • 

This problem has been considered from a Bayesian 
point of view for SU(2) in Refs. [1 S, @ and for a 
general SU(d) in Refs. [y, 0]. They study the case where 
each copy of U is used only once and obtain that the op- 
timal mean square error (MSE) goes to zero at the rate 
1/n 2 compared to the 1/n rate that would be obtained 
if no entanglement in the input state were allowed. Even 
though this problem is very interesting from a theoretical 
point of view, it is more likely that one does not have an 
arbitrary number of copies of a unitary gate, or that it 
is not yet practically feasible to create such a large en- 
tangled input state. Therefore, it would be more natural 
to assume that the number of copies n is fixed and to 
repeat the experiment a number N of times. Clearly the 
TV-dependence of the MSE will be of the from 1/N as it 
is the case in models of the form p 8Ar , so most of the 
effort will be on optimizing the n-dependence. Still one 
could compare the results obtained in this approach with 
the results obtained in 0, H 0, H, 0] • 

The authors of Ref . [8| , address the question of finding 
conditions for a general quantum operation to exhibit 
this behavior. 

To state the problem more precisely, let u> on C dA ® 
C d be the input state, where d A is the dimension of the 
ancilla. The output state p, also on C dA <g> (C d )®", then 
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becomes 



p= (l A ®U® n )uj(l A ® u® n )\ 



where 1^ is the identity operator on the ancilla. This 
output state is then measured and the outcome of the 
measurement is recorded. This process is repeated N 
times and from the measurement outcomes an estimate 
of U is made. The situation is represented in Fig. [1] 
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FIG. 1: n copies of U are available at the same time. 

In this paper, it is assumed that U is parametrized 
by 8 € R d , and an estimate 6 of 9 is made. These 
considerations allow one to formulate the model studied 
in this paper as 



p n {6,u)W = g, U(0f n Hl A ® U{9f 



mif 



(1) 



Obviously, if the input state u) is separable the situa- 
tion would basically be the same as having only one copy 
of U and repeating the experiment nN times. In that 
case, results of classical statistics imply that the MSE 
would behave as l/(nN). The question is whether by 
using an entangled state u> the dependence on n can be 
improved. As one might expect, the answer to this ques- 
tion is positive. 

In the rest of this paper it will be shown that, for 
SU(d), there is an input state, a measurement and an 
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estimator, such that their MSE vanishes at a l/(Nn 2 ) 
rate. We use the so called pointwise approach, which dif- 
fers from the (more commonly used) Bayesian approach. 
In the Bayesian approach, both the measurement strat- 
egy and the estimator are so chosen as to minimize the 
average of a loss function (often one minus fidelity) with 
respect to a given prior distribution for any N. In con- 
trast, in the pointwise approach one's goal is to optimize 
the performance of a scheme at a fixed point (the truth) 
in parameter space for large N. 

In section |n] some quantum statistical results and no- 
tation are introduced. The notion of spherical 2-designs, 
which will be useful for obtaining an optimal input state, 
is also defined. The figure of merit for evaluating the per- 
formance of the input state and the measurement used, 
is also specified. In section IIII1 the optimal input state 
for our figure of merit is found. The state is optimal in 
the sense that it minimizes a lower bound on the MSE 
of any measurement (the quantum Cramer-Rao bound) 
and also in the sense that it guarantees the existence of a 
measurement that achieves this bound. It is shown that 
optimal input states can be constructed from 2-designs 
(if they exist for dimension d). It is possible, however, 
that a construction of a 2-design is not known in C d (e.g. 
because they do not exist). We show in section ITVl that 
one can be arbitrarily close to optimal by using an ap- 
proximate version of a 2-design. In section [V] it is shown 
that there exists an optimal measurement strategy which 
can be performed by local operations and classical com- 
munication (LOCC) between the ancilla and the rest of 
the system. Unfortunately, the optimal measurements 
shown here have the problem of being a different one for 
different values of 9. This problem can be overcome in a 
two-step adaptive strategy like the one used in Ref. @. 
Alternatively one could use the so-called random mea- 
surement. This measurement can be described as fol- 
lows: at every repetition of the experiment, one chooses 
a basis uniformly at random and measures in this basis. 
The random measurement gives the same performance 
regardless of what the actual 9 is, but is only half as 
good as the optimal one, this is proven in the appendix. 



II. PRELIMINARIES 

In this section some notions and results needed for the 
rest of the paper will be introduced. 



an unbiased estimator 1 for 9 G C M. d _1 , the a! 2 — 1- 
dimensional parameter of a density matrix p. The MSE 
can then be written as the (d 2 — 1) x (d 2 — 1) matrix with 
elements 

V n 0,9,M t u,) l $ 
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The Fisher information (FI) can be calculated as the 
(d 2 — 1) x (d 2 — 1) matrix with elements 



[N] 



E 



\x\p n {9,u)\ If \M t 



[N] 



Me 



where f(9), a means partial derivative of f{9) with re- 
spect to 9 a (later one the notation d a f(9) will also be 
used), and Q+ = {£ e O : tr[p n (9, w) [JV] M ? ] > 0}. The 
Cramer-Rao bound (CRB) [Hj] states that 

V n {§,6,M,u)W > [I n (9,M,u>)W]-i. 

The previous equation should be interpreted as a matrix 
inequality, i.e., (V—I -1 ) is a positive semidefinite matrix. 

If one performs a measurement M which consists on 
repeating the same measurement m on each of the copies 
then the measurement results will be independent and 
identically distributed, (i.i.d.) and the Fisher informa- 
tion will satisfy 



I n (0,M i w)W = NI n (O,m,w) 



[i] 



(2) 



and it follows that 



V n (9,e,M,cj) 



[N] 



> 



[I n (0,m,w)W 
N 



It is a well known fact in mathematical statistics that 
(under some regularity conditions) the maximum likeli- 
hood estimator (MLE) in the limit of large TV is asymp- 
totically unbiased and saturates the Cramer-Rao bound 
[l(|- Moreover no other reasonable estimator (unbiased 
or not) can do better This means that it is enough 
to concentrate on finding a measurement that optimizes 
the Fisher information for N = 1 and use the MLE. 

The quantum Fisher information (QFI) is defined as 
the matrix with elements 

H n (9,w)W = tr[p n (9,u)WA n (9,u)W o X n (9,u J )f ] }, 

where {A n (0, w)i> ■ ■ ■ > ^n(9, w)p } are the symmetric 
logarithmic derivatives (SLD), and^loi? = (AB+BA)/2. 



Quantum statistics 



Let {M^ : £ £ fi}, be a positive operator valued mea- 
sure (POVM), where fl is the outcome space. Let 9^ be 



Unbiased means that the expectation of the estimator is equal 
to the truth, i.e., E8 a = J2(ea tr[p(0)M 5 ]e ?a = 6 a . 



3 



The SLD are defined as selfadjoint solutions to the equa- 
tion 



d a p n (9,w 



[ N ] _ Pn(9,u) [N] O X n (9,Uj) [ a 



[N] 



For pure state models the SLD can be chosen to be 
\ n (6,Lu) l a ] = 2d a p(6,uj)( N > n \ From the definition of the 
QFI, it is easy to derive that 

H n (0,cj)W=NH n (e,u)W, 

from now on H n (9, w)W will simply be denoted by 

The Fisher information of any measurement is upper 
bounded by the QFI [H, i.e., 

J„(0,M,w)M < H n (9,w)W = NH n (6,uj), VM, (3) 

this is the Braunstein and Caves information inequality 
(BCII). Furthermore if there is another real symmetric 
matrix 7 such that VM, I (9 , M , w)( A » < I, then it fol- 
lows that / > H n (9, , i.e., the inequality is sharp. 
The BCII together with the CRB give rise to the quan- 
tum Cramer- Rao bound (QCRB), 



(4) 



N 



The inequality given by Eq. ^ is in general not at- 
tainable: in general there is no measurement M such 
that I n (0,M,oj) [N] = H n (9,uj) [N] . Because of this, it 
turns out that it is not always possible to compare the 
FI of different measurements. One has to choose what 
one wants to estimate by assigning weights to the differ- 
ent parameters, i.e., minimize an expression of the form 
Tr GV n (0, 9, M, u>)W, where G is a real positive semidef- 
inite matrix, over all measurements, (reasonable) esti- 
mators and input states. Since the MLE asymptotically 
achieves equality in the CRB the problem can be reduced 
to minimizing Tr G[I(0, 9, M, a;)^'™)]" 1 over all measure- 
ments, and input states. From Eqs. @ and © it follows 
that the optimal FI 1(9, 8, M, uj)( n ^ ~ N. It is therefore 
meaningful to look at the quantity 

C n (9,Lu,G)= lim NmaxTr G[I n (9,M,uj) [N] ]-\ ( 5 ) 



from ([3]) one readily obtains that 

C n (0,uj,G) >TTG[H n {9^)]-\ 



(6) 



For pure state models (in our case when ui = Jf2)(f2| 
for some |0) £ C dA <g> C d "), it has been shown []]| that 
([3]) is attainable if and only if 



Imtr 



= 0. 



(7) 



In that case the bound can be attained by independently 
performing the following measurement at each repetition 

m£ = \me){mt\ £ € {1, . . . , d 2 l}, 

(8) 



m d 2 +1 = 1 - ^m ? , 



where 



xh 



x=i 

d 2 -i 



\b a ) = Y,lHn(8,w)}jK(9,Lo) [ p l llA®U(9)® n }\n), 
0=1 

M = [l A ®U(6)*"]\Sl), 

where o a d 2 x d 2 real orthogonal matrix satisfying o^^ 2 
0. 

The previous measurement has the drawback of de- 
pending on 9, the actual value of the parameter, which is 
what one wants to estimate. This problem can be over- 
come by using a two step adaptive strategy like the one 
used in Ref. 0. One spends \/~N of the repetitions in 
finding a rough estimate 9\ of 9 using any information- 
ally complete measurement. Then the measurement (jSJ) 
is performed on the rest of the copies as if the truth were 
0\. This is also optimal 



B. 2-designs, MUBs and SIC-POVMs 

We will need the notion of mutually unbiased bases 
(MUBs), which was introduced in [la ]. The following 
definition closely follows the one given in 16]. 

Definition II. 1 (MUBs). Let By = {|<#>, . . . , |<^)} and 
B2 = {|<^i)> • • • > l^d)} be two orthonormal bases in C d . 
They are said to be mutually unbiased if and only if 
\(4>\ 1^)1 = l/Vd, for every i,j = l,...,d. A set 
{Bi, . . . , B m } of orthonormal bases in C d is called a set of 
mutually unbiased bases if each pair of bases is mutually 
unbiased. 

In any dimension d, the number of mutually unbiased 
bases is at most d + 1 [r|. Explicit constructions are 
known if d is a prime power [l5l. Il6j. Unfortunately not 
very much is known in other dimensions, for example, it 
is still an open problem whether there exists a set of 7 
MUBs in dimension d = 6. 

The notion of symmetric informationally complete 
POVMs (SIC-POVMs) [13 will also be useful. 

Definition II.2 (SIC-POVMs). Let {\ X i), \Xd 2 }} be 
a set of state vectors in C d satisfying \ (Xi\Xj)\ = (<^+l) 1 
for every i =/= j . Then 



IxiXxi 



\xd 2 ){xdA 
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is called a SIC-POVM. 

The fact that this actually is an informationally com- 
plete POVM follows from this definition [l7|. They have 
been shown to exist for d £ |2, 3, 4, 5, 6, 81 and are con- 
jectured to exist in all dimensions (17L Ha, Il9|. 

It is easy to check that MUBs and SIC-POVMs satisfy 
the following property: 

, d+l d tt(2,c0 



d(d + l) 



b=l i=l 



n 



(2,d) 



d(d + l)' 



where 11^'^ is a projector onto the completely symmet- 
ric subspace of C d <X> C d . Indeed, one can straightfor- 
wardly check that the Hilbcrt-Schmidt distance between 
the right hand side and the left hand side is, in both 
cases, zero. 

It was shown (2(| that any set of vectors satisfying this 
property, forms a spherical 2-design. More precisely, if a 
set of state vectors {|ti), . . . , |r m )} in C d form a 2-design, 
then they satisfy 



i — l 



1)' 



(9) 



For a formal definition of 2-designs, see for example Refs. 

MM. 



C. A Chernoff bound for matrix valued random 
variables 

The following result, due to Ahlswede and Winter [2l[ 
will be useful when dealing with approximate 2-designs. 

Theorem II. 3 (Ahlswede and Winter). Let X\, . . . , X m 
be p x p i.i.d. selfadjoint random variables satisfying < 
X b < I, EX b = M > pi and < e < 1/2. Then 



Pr 



1 



6=1 



X h - M 



> eM 



< 2pexp 



41n2 ? 



.(10) 



III. OPTIMAL INPUT STATE 

From Eq. ^ it is apparent that an optimal input state 
u> is one that minimizes Tr G[H n (8, w)] _1 - However it is 
possible that even if one minimizes this quantity, there is 
no measurement that achieves equality in ([5]). Moreover 
the problem of minimizing Tr G[H n (9, ui)]^ 1 for a general 
G is very hard, therefore we will concentrate in the case 
G = I, the (d 2 - 1) x (d 2 - 1) identity matrix. With 
this choice of G, it will be shown that it is possible to 
minimize Tt[H ti (0, w)] _1 and at the same time, guarantee 



the existence of a measurement that achieves equality in 
Eq. ©. 

Since from now on we work with N = 1, we will write 
(n) instead of (l,n). Also the dependance on 8 and u> 
will be omitted most of the times. 

Since the QFI is convex [22j , the search for an optimal 
input state can be restricted to pure states. Let |f2) € 
C dA ® C d be the input state, the output density matrix 
is 



[1a® u® n ]\n)(n\[i A ®u® n ]i. 



Let us define p Y as the average one-copy reduced den- 
sity matrix of p, i.e., 



Pi 



1 ™ 



where tr§ means partial trace with respect to all copies 
except the s th one. In the same way, let us define p 2 as the 
average symmetrized two-copy reduced density matrix of 
P, i-e-, 



Pi 



1 " 
n(n - 1) ^ 



tTsrp + W(tTsr P )W 



where tr^p means partial trace with respect to all copies 
except the r th and the s , and W is the swap opera- 
tor W : \i/})\<t>) h-> \(f>)\tp) for all \ip),\<j>) & C d . W can 
be expressed as W = J2 kl \kl)(lk\ where {|fc)} is an or- 
thonormal basis of C d . 

Lemma III.l. H n is given by 

Hna/3 =4n (Retr[oJBi*a*^] + (n - 1) tr [uJB 2 (*a <8> ^J-qn 
- ntr[aJg' 1 i C( ] tr[aJBit/3]) , 

where t a = iU'U <a and lob = tr C d A w. 

Note that H n depends only on ujb 2 an d that it will 
scale at most like n 2 . 



Proof. In this model the SLDs are 

A^ l) =2[(i A <g> d a u® n )\n)(n\(i A ® 

+ (1a <8 £/ 8 ")|f>)<f>|(iA ® s Q ^")]t, 

so that 

Lg=4[<n|i A ®TWrW|n> 

- (niiAsrWinxniiAST^in)], 

where L<§ = tr A^A^ , 



(12) 



E 1 ' 



( S -i) 



}(n— s) 



€ Su(cT) 
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and t a = iWU tCt £ su(d). 
Let 



A/ 



where (IV'k)) ^ s a system of orthonormal vectors in 
(respectively C d ®™), then (fT2]) may be rewritten as 



r(n) _ 
where 



AT 



A" 



Now, (|12|) may be rewritten as 

L aP =4n(tr[D — 1 £ Q ,ig] + (n - 1) tr [w^ 2 (*a ® 
- ntr[cJ^it Q ] trfaT - -^]), 



and the QFI is H na p = ReL 



(„) 



□ 



The following lemma examines the conditions for which 
equality can be achieved in Eq. ([3]). 

Lemma III. 2. There exists a measurement that achieves 
equality in the BCII (Eq. if and only if 



WB2 = 



10 1 



J 2 



2J h a pt a tf, 

a/3 



where h a /3 = hp a . 

Proof. Obviously, ub 2 is supported in the symmetric sub- 
space of C d 0C d . The most general state in the symmetric 
subspace can be written as 

1(g)! v^, r* , 

WB2 =—3 V 2_^b a [l ® t a + t a 



d 2 

a 

f 2J ^a^*a i/3, 



1] 



(13) 



a/3 



where h a p — hp a , lobi is then 



LUB 



1 \ , 

1 = 3 +d^b a t a . 



The condition reduces to 

tr(S?B 1 [t a ,* / j])=0 1 Va > j8. 

Since {tx, ■ ■ ■ , id 2 -i} span su(d), the Lie algebra of 
SU(d), the previous equation implies that for any r, s € 
su(d), tr(wBi[r, s]) = 0. Furthermore, since any t £ su(d) 
can be written as the commutator of two other su(d) ele- 
ments, we have that for all t £ su(d), tr (STsjt) = which 
in turn implies that TJbx = t/d or b a = 0. Therefore, 
LUB2 must be of the form 



1(81 v^T 



a/3 



□ 



From here on, the parametrization will be chosen in 
such a way that tri Q i^ = 5 a p, this allows one to express 
W as 

d 2 -i 

W = — — + 2^ *a®ta- 
a— 1 

The following lemma deals with minimizing Tr[if n ] — . 
It turns out that the input states that minimize this 
quantity also satisfy the conditions of lemma ITU. 2 1 so that 
this minimum value can also be attained. 

Lemma III. 3. Any input state u satisfies 

d{d+l) 2 {d-l) 



Mn + d) 

with equality if and only if 

10 1 1 x - 



(14) 



^B2 



d(d+l) 



d(d+l) 



-n 



(2,d) 



(15) 



Proof. The trace of -ff„ for the most general symmetric 
Z d C d (1131) can be written as 



U) B 2 on 

Trff n = 4n[trw Bl J^t* + (n - 1) Trft - nd 2 ^b 2 a ], 

a a 

where h = [h a p]. The operator ^ Q t 2 a is a Casimir oper- 
ator and therefore proportional to the identity, the pro- 
portionality factor can be found to be (d 2 — l)/d by taking 
the trace, then 

d 2 - 1 ~ 
Tr H n = 4n[— — + (n - 1) Tr ft, - nd 2 ^ b 2 J. 

a 

The trace of h can be easily found 

Tr ft, = h aa — trfw - 2 E ^ a ® 



1 



1 



= trw B2 VK -3 = 1-3, 
a a 

where we have used that wg 2 is supported in the sym- 
metric subspace of C d C d . Therefore the trace of i?„ 
satisfies 



Tr H„ = An 



< An 



- + (n-l)Tih-d 2 Y / b l 



d 2 -I , d-1 
— _ + ( n -l — - 



d- 1 , 
= A—j—n(n + d), 



with equality if and only if b a — 0. Using the Cauchy- 
Schwarz inequality and the previous equation one gets 



6 



that 



Trf^r 172 ^] 1 



/2 



<Tr[H n ]Tr([H n ]- 1 ) 



< 4- 



-n{n + d)C n {9,L0,T), 



which implies (|14[) . Equality is attained if and only if H n 
is proportional to [-ffn] -1 and 6 Q =0. H n is proportional 
to [Hn]^ 1 if and only if it is proportional to the identity 

which happens if and only if h is proportional to the 
identity. Therefore we have that the optimal h is 

^ = dfJ+i) 1, 

and the optimal Ub2 is given by (fl~5 
sponding to this state is 

n(n + d) 



d(d+l) 



The QFI corre- 

(16) 
□ 



Next one needs to find an input state |f2) such that 
(fl~5| holds. In the dimension where 2-designs exist, they 
can also be used to construct input states that satisfy 
(fTS"]) . Indeed it is easy to check that if the vectors 



(In),.. 



\i~m)} form a 2-design then the state 



|fi) 



(17) 



satisfies ([15)) . In particular, the dimension of the ancilla 
would be d A = d 2 if a SIC-POVM is used, and d A = 
d(d + 1) if a set of d + 1 MUBs is used. 

We now have an input state (given by (fTT)) ) that sat- 
isfies (|15[) and therefore is optimal in the sense of lemma 
IIII.3I However, we still need to check that it satis- 
fies one more condition: there should be a one to one 
correspondence between unitaries U and output states 
[1a <8> ?7® n ]|f2), this is proven in the following lemma. 

Lemma III. 4. The input states given by |J7[ ) satisfies 

\(n\[i A ®u® n ]i[i A ®u® n ]\n)\ = i 

if and only if U\lJ2 is proportional to the identity, i.e., 
they can only differ by a multiplicative phase. 

Proof. Let U = U\lJ2 be diagonalized as 



U 



d 
k=l 



Uk)(Uk\ 



For the input state (fl"7|) . we have that 



\({i\[i A ®u® n Vii A ®u® n ]\n) \ = - 



£< 



1 rn 



< i. 



One of the conditions for equality is that |(t,|J7|t,)| = 1 
for all i G {1, . . . , m}, i.e., (rj|l/|Ti) = e 1 ^. We have then 
that 



(Ti\U\n 



fe=i 



which implies that for all « G {1, ...,m} and fc € 
{1, . . .,d} either (fc|r. ( ) = or e"' fc = e*. Next we will 
prove that for every k ^ I E {l,...,d} there exists an 
iG {1, . . . , m} such that both (fe|r») ^ and (Z|Ti) 7^ 0, 
this would imply that e I?)fc = e lr,! = e l ^ i which would 
finish the proof. Indeed, we have that 



|fc0 = (w| 



■n 



(2,d) 



|feZ), 



or 



1 in 

-J2\( k \n)(l\n)\ 2 = 



1 



d(d+l)' 



wich implies that there must exist at least one i such that 
|<fe|ri)(I|ri)| > 0. □ 

It is now possible to state the main theorem. 

Theorem III. 5. The input state given by satisfies 



1. The map U ^ [l A ®U® n ]\n) from SU{d) to C dA ( 
(C d )®" is infective. 



C n (0,uj,l) = 
which is optimal. 



d(d+l) 2 {d-l) 
An{n + d) 



Proof. Point[T]is proven in lemma HlI.4l Point[2]is proven 
by lemma Hll.31 and the fact that the input state given by 
(fTT)) satisfies Eq. JISJ). □ 

All the main ingredients for the optimal estimation 
of n copies of a SU (d) quantum operation have been 
proven. As input state one can choose (fT7|) if they exist 
for dimension d. The used ancilla has dimension d A ~ 
d 2 .This input state is optimal as proven in theorem lIII.5l 
The output states are measured using the recipe given by 
([8]), data are collected and an estimate of the parameter 
is given by using the MLE. 

The case where there is no known construction of a 
2-design in C d is dealt with next. 



IV. APPROXIMATE 2-DESIGNS 

Let {Ui, . . . , U m } be an i.i.d. sequence of unitaries cho- 
sen uniformly at random from the Haar measure. Let 



d 



>md 



^^\bk)®[U b 



(18) 



fe=i 6=1 



7 



and u> — |f2)(f2|. For this choice we have that 

dm 



wax 



md 
1 



fc=l 6=1 



which ensures that equality can be achieved in the BCII 
(Eq. ([3|)). It is also easy to check that 



E L0B2 



d(d+l) 



-n 



(2,d) 



Var w B2 = — -E u B2 , 

TOGl 

i.e. the larger to is, the closer a7g 2 will be to satisfying 
([9]). This is why we call them approximate 2-designs. 

Using (fTTj) one can calculate the QFI corresponding to 
the input state (JTSJ 

H n (Ui, . . . ,U m ) = -J2h n (U b ), 



6=1 



where 

h n {U)al3 —~T [f>aP 



-(n-l)^{k\Uh a U\k){k\Uk f3 U\k) 



fc=i 



Lemma IV. 1. Let 6e t/ie optimal QFI f!6\) , then if 



4(d+l)ln2 , 
7Ji > - In 



2(d 2 - 1) 



we have that 

(l-e)m- 1 <H- x <{l + e)[H^\-\ 
holds with probability at least q. 
Proof. The strategy is to apply theorem III. 31 to 
[Hl\-V*h n {U h )[H°]-^ 



(19) 



X,, 



-, be{l,...,m}. 



d+1 

Claim IV. 2. X b < I. 

Proof of claim \IV.S\ This will be done by showing that 
K{U) <(d+ 1)H° for all U. Indeed, let igR^be 
a unit vector, and i = ^ a x a i Q , we have that 

x T h n (U)x = };X a h n (U) a i3Xf3 

a/3 

in 



l + (n-l)53(ifc|l7 t tC/'|A) s 



fe=i 



< 



d 

~d 

4ro 2 

— <(d+1) -d(rf+l) 
(d+l)a; T ff°a;. 



l + (n-l)^|(fc|^[/|0| : 

kl 

4n(n + d) 



Where we use the fact that trt 2 = 1 and that n < (n+d). 

,2 -I 

Since the above equation holds for any x G R we have 
K{U) < (d + 1)H° as desired. □ 

Next, we need the expectation of h n {u). 
Claim IV.3. 



lX h 



d+1 



Proof of claim \IV.3[ It suffices to prove that E/i„ (U) — 
H°. 



in 
~d 



Eh n {U) af3 = / h n (U) af3 dU 
-{n - 1) J2 tr ([*« ® J {U\k)(k\uT 2 dU 



in 
~d 

in 
~d 



— r-r° 

— n nal3- 



2(n- 1) 



d(d+l)^ 
(n-1) 



^X>([t a ®t*]n? , '° 



d+l 



□ 



We can now apply theorem III. 31 with p = d 2 — 1 and 
(i = l/(d+ 1) to get 



Pr 



1 m 
TO * — ' 



6=1 



d+l 



> e 



d+l 



< 2(d 2 - l)exp 

or in terms of H n 

Pr [\H n - H° n \ > eH° n ] 

< 2(d 2 - 1) exp 



4(d+ l)ln2 



e 2 TO 



4(d+ l)ln2 



□ 



The statement of the lemma follows immediately from 
the previous equation. 



Corollary IV.4. If 

4(d + l)ln2 



TO > 



In [2(d 2 - 1)] , 



then there exists a choice {Ui,...,U m } such that U9\) 
holds. This implies that using these unitaries in the input 
state fTl 



d(d+l) 2 (d- 1) 
4n(n + d) 



C n (9,u,I) < (1 + e) 



8 



Comparing this upper bound with the lower bound from 
lemma Un. 31 one can see that by choosing m large enough, 
one can be arbitrarily close to optimality. 



We also need to prove an equivalent of lemma IIII.41 
i.e., that a state of the form (fTg)) also gives a one to one 
correspondence between input states and unitaries. 

Lemma IV. 5. A state of the form H8\) also gives a one 
to one correspondence between input states and unitaries 
with probability 1. 



Proof. As in lemma lTlL4l it suffices to show that | (1a® 
I7® n )|f2) | = 1 holds if and only if U is proportional to the 
identity. 



|<0|(1, 



l )l^)l = ^ 
ma 



J2(k\uluu b \ky 



hi, 



< 



1 Y,\(k\UtUU b \k)r 



md 



l-b 



< 1, 



with equality only if | (k\Ul UUb\k) \ = 1 for all k and b. 
In particular, this means that for every b, {\k)} is a basis 
of eigenvectors of UlUUb, i.e., 



UluU b =J2^ k \k){k\, 

k=l 



U = Y,^ k Ub\k){k\ul 



k=l 



Take now b ^ b' , we have that 



e*** = (k\uluU b \k) = J2e^\{k\utu b ,\l)\ 2 . (20) 



i=i 

Since U b and U b > where choosen unifornly at random, 
it is true that with probability 1, \(k\ulu b >\l)\ 2 > for 
all I. This, together with Eq. ([2H)l . immediately implies 
that for all I, <pi = 4>k which in turn means that U is 
proportional to the identity as desired. □ 



V. OTHER MEASUREMENT STRATEGIES 
(LOCC AND RANDOM) 

A. LOCC measurements 



the rest of the system. If the input state is of the type 
(jTTJ) Bob's reduced state is 

m 

PB = -J2W T r)( T r\UT n , 
r=l 

a separable state. This means that Bob's optimal esti- 
mation strategy will have an MSE which depends on n 
as 1/n at best. If Alice sends the ancilla to Bob, he will 
be able to achieve the 1/n 2 rate. In the case of MUBs 
and SIC-POVMs the ancilla will be small, its dimension 
is of the order d 2 , i.e., independent of n. 

It is also interesting to ask what happens if Alice and 
Bob can exchange classical information. Consider the 
following simple LOCC measurement: Alice performs the 
measurement with components A r = \r)(r\ on the ancilla 
and then sends the outcome to Bob. With this informa- 
tion Bob's state becomes 



PB\ r =[U\r r )(r r \U^ 



0)1 



which is also a product state so its Fisher information 
given r will behave as n. The total Fisher information 
will be the average of the Fisher informations for fixed 
r and so will also behave as n and the MSE as 1/n. Of 
course, this is a very special LOCC measurement, it turns 
out that, at least in the dimensions where there exist d+l 
MUBs, there exists an LOCC strategy which is optimal. 

Lemma V.l. If there exists a set of d+l MUBs, then the 
bound |J^[) can be attained using an LOCC measurement. 



Proof. The lemma is proven by showing such a strategy. 
The output state is 



d+l d 



==^^i&fc)®[[/|^)p». 



Vdjd + l) 



b=l k=l 



Alice measures performs in the ancilla the measurement 
with elements A b k = \b)(b\ <8> |/fc)(/fc|, where 



\h) 



1 d 



Vd 



i=i 



2mkl 



10, 



is the Fourier transform of the basis She obtains 

outcomes b, k with probability [d{d + l)]~ x and commu- 
nicates her outcome to Bob. In that case, Bob's state 
becomes 



)B\bk 



1 

7^ 



exp 



i=i 



2itikl 



(21) 



It is interesting to see how the problem changes if 
the type of measurements that can be performed is re- 
stricted. Suppose, for example, that the measurement is 
performed by two parties, Alice and Bob. Suppose also, 
that Alice has access only to the ancilla and Bob only to 



He should still perform a measurement on this state. The 
Fisher information of this procedure is 

d(d + l)^ ' 



9 



where 1^ is the Fisher information of Bob's measure- 
ment on the state (|2"Tj) . The QFI for the state (JUJ) can 
be calculated using Eq. (fTTjl 



„[Wc] _ 4n 



S a0 + (n-l)J2(ti\t«\<l>1){tf\t 



Furthermore, the condition ([7]) is satisfied therefore there 
exists a measurement (e.g. the measurement given by ([5])) 
which achieves equality between the QFI and the FI. Us- 
ing the fact that the d + 1 MUBs from a 2-design, one 
can check that 



d(d 



1 yum 
+ i) ^ 

' bk 



n(n + d) 
djd+l) ' 



which is exactly the optimal value |T 



□ 

Remark V.2. This proof can be easily adapted to show 
that for an input state of the form H8\) there exists an 
LOCC measurement such that there is equality between 
the FI and the QFI corresponding to that state. 

This result appears to contradict Ref. [23| where it was 
shown that for n — 1, optimal collective measurements 
were at least 2(d+l)/d times better than any LOCC mea- 
surement. However there is no contradiction. In contrast 
with the present work, in Ref. [23] the ancilla had di- 
mension d and the state used was a maximally entangled 
state. This was enough to obtain the optimal QFI and 
to guarantee the existence of a measurement that attains 
the QCRB. Once this input state is fixed one obtains the 
mentioned advantage of optimal collective measurements 
over LOCC ones. Here it is shown that by allowing a 
larger ancilla, one can still be optimal, and LOCC mea- 
surements can perform as well as collective ones. 



B. The random measurement 

The only perhaps not so desirable feature of the mea- 
surements strategies described so far, is that they make 
use of the recipe given by ([5]). This recipe gives a dif- 
ferent measurement for different values of the parame- 
ter, one may need to use an adaptive strategy like the 
one described at the end of section III Al This undesired 
feature can be easily avoided at the cost of being subop- 
timal by using the so-called random measurement. The 
random measurement can be described in the following 
way: at every repetition choose an orthonormal basis of 
C dA ® (C d )® n and measure on that basis. 

Lemma V.3. The random measurement M r achieves a 
FI which is half the QFI, ie., 

7^,M r)W ) = ^#^=2^±^I, (22) 



d(d+l) 



that 



One would achieve the same if one modifies the LOCC 
strategy from lemma IVTT1 such that Alice does the same 
but Bob performs the random measurement on his part 
of the system ([C rf ]®"). 

A general proof of (|2"2"|) will be given in the appendix. 



VI. DISCUSSION 

We have found an estimation strategy with an MSE 
that behaves like 

This gives us a hint on how to tackle the problem in which 
one is allowed to use every copy of U only once and n is 
not fixed. The strategy would be to divide the n copies 
into n e groups of n 1_e copies each where e is an arbitrary 
small but strictly positive real number. Then for each of 
the n e groups, one would perform the optimal strategy 
described above in this paper independently. With this 
procedure, one would expect the MSE to behave as 



MSE(0,0,M) 



d(d + r 



I + o(l/n 2 - £ ), 



i.e., as close to the 1/n 2 rate as one wants. However, in 
this situation for each n the model is a different one, so 
we would not be in the familiar i.i.d. case either. 



VII. CONCLUSIONS 

We have considered the problem of estimating an 
SU(d) operation when a fixed number, n, of copies is 
available. By allowing entanglement in the input state, 
we have found an optimal estimation strategy where the 
MSE vanishes at a l/(7Vn 2 ) rate, where N is the (large) 
number of times the experiment is repeated. This is much 
better than the l/(Nn) rate that one would obtain if no 
entanglement in the input state were allowed. We have 
shown that the optimal input states can be constructed 
from 2-designs, if they exist for dimension d, otherwise 
from approximate versions of them. In both cases, these 
input states have another interesting property: if one has 
no access to the ancilla, the reduced state is separable 
and thus the MSE will behave as l/(Nn) at best. Wc 
have also shown that, if a set of d + 1 MUBs exists or if 
one uses an approximate 2-design, classical information 
about the ancilla is actually enough to achieve equality 
in the Braunstein and Caves information inequality. In 
particular this means that in the former case one can be 
optimal with LOCC measurements. The optimal mea- 
surements found here are adaptive ones. It was shown 
that the so-called random measurement can be used to 
avoid this at the cost of being suboptimal. It would be 
interesting to find a non-adaptive measurement which is 
optimal for all values of the parameter. 
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APPENDIX A: THE RANDOM MEASUREMENT 

Suppose we have any pure state model on a d- 
dimensional Hilbert space Ti. (i.e. the number of param- 
eters p can be anything between 1 and 2(d — 1)). It will 
be proven that the random measurement M r , which con- 
sists of choosing a basis of TL uniformly at random and 
then measuring on that basis, achieves 



I(6,M r ) = ±H(e), 



(Al) 



in particular, this measurement would be optimal when 
p = 2(d— 1) if asymptotic fidelity is taken as the figure of 
merit. This measurement has also been studied in [24(. 

Let and {Ai, . . . , A p }, be the state vector and SLDs 
at the true value of the parameter respectively, and let 
be any basis of TL. It is an easy exercise to show 

that 



I(9,{\k)(k\}) af3 = l -H(9) aP 



1 d 



fc=l 



(k\l a )(k\l 



(iP\k) 



where \l a ) — Xalip}- Let C be the subspace of TL spanned 
by . . • , \l p )} and 1Z be its orthogonal complement, 
let lc and In be projectors onto C and 7Z respectively. 
Let the unitary operator Y be defined as 



Y=t 



n 



■He- 



One should keep in mind that V depends on the real 
value of the parameter, in particular, Y\ip) — \tp) and 
Y\l a ) — i\l a ). Clearly {Y"|fc)} is a new basis and it is 
easy to see that 



I(0,{Y\k)(k\Yi}) af3 = ±H(9U 



1 * 

o£ Re 



fe=i 



(k\l a )(k\l p \ 



(V| fe) 



(*#> 



Let M be the measurement with elements 

[i|i)(l|,...,i|d)(d|,iF|l)(i|Ft,...,iy|d)(d|yt 



1(8, M) = \ [l(6,{\k)(k\}) + I(9,{Y\k)(k\Y^})] 



Since {\k)} is any basis, also the measurement Mjj with 
elements 



lu\i){i\u\...,lu\d){d\u\ 



l -YU\l)(l\U^Y\..., l -YU\d)(d\U^Y 

where U is any unitary will also satisfy 

I{B,M v ) = \h{9). 

It is not hard to see that choosing U at random and 
performing M\j is the random measurement and there- 
fore (|Alj) must hold. This can also be shown analytically. 
The Fisher information for the random measurement is 

1(6, Mr) = [ fi(dg)I(e,{U g \k)(k\Ul}) 



- J v(Ag)I(e,{U g \k)(k\Ul}) 
\ f fi(dg')I(0,{U g '\k)(k\ul,}), 



where fj, is the normalized Haar measure and the integrals 
are over SU(d). Now let g' = hg and Uh — Y, we get 

1(9, M r ) J- J n(dg)I({9,U g \k)(k\Ul}) 

\(dg)I(9,{YU g \k)(k\UlY^}) 
=~ J n(dg)H(9) = l -H(9), 

where we have used that 

I(9,{U g \k)(k\Ul}) + I(9,{YU g \k)(k\UlY^}) = H(9). 

In the case studied here, the random measurement 
would consist of choosing a basis of C dA ® (C d )®" uni- 
formly at random (with respect to the normalized Haar 
measure) and then measuring on that basis. Such a mea- 
surement is, as it was mentioned before, independent of 
the parameter and achieves 
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